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ABSTRACT 

We measure the angular two-point correlation function of sub-millimeter 
galaxies (SMGs) from 1.1-millimeter imaging of the COSMOS field with the 
AzTEC camera and ASTE 10-meter telescope. These data yields one of the 
largest contiguous samples of SMGs to date, covering an area of 0.72 degrees^ 
down to a 1.26 mJy/beam (1-cr) limit, including 189 (328) sources with S/N> 3.5 
(3). We can only set upper limits to the correlation length ro, modeling the 
correlation function as a power-law with pre-assigned slope. Assuming existing 
redshift distributions, we derive 68.3% confidence level upper limits of Tq < 6- 
8 h~^Mpc at 3.7 mJy, and ro < 11-12 h~^Mpc at 4.2 mJy. Although consistent 
with most previous estimates, these upper limits imply that the real rg is likely 
smaller. This casts doubts on the robustness of claims that SMGs are charac- 
terized by significantly stronger spatial clustering, (and thus larger mass), than 
differently selected galaxies at high- redshift. Using Monte Carlo simulations we 

^Astronomy Department, University of Massachusetts, 710 North Pleasant Street, Amherst, MA 01003, 
USA, ccwillia@astro.umass.edu 

^Argelander-Institut fiir Astronomic der Universitat Bonn, Auf dem Hiigel 71, D-53121 Bonn, Germany 

•^Department of Physics and Astronomy, University of Pennsylvania, 209 South 33rd Street, Philadelphia, 
PA 19104, USA 

''Center for Astrophysics and Space Astronomy, University of Colorado, Boulder, CO 80309, USA 

^Instituto Nacional de Astrofisica, Optica y Electronica (INAOE), Aptdo. Postal 51 y 216, 72000 Puebla, 
Pue., Mexico 

^Nobeyama Radio Observatory, Minamimaki, Minamisaku, Nagano 384-1805, Japan 

^Yale Center for Astronomy and Astrophysics, Department of Physics, Yale University, New Haven, CT 
06520, USA 

^Institute of Astronomy, the University of Tokyo, 2-21-1 Osawa, Mitaka, Tokyo 181-0015, Japan 
^Department of Physics, Illinois Wesleyan University, Bloomington, IL 61701, USA 



-2- 



show that even strongly clustered distributions of galaxies can appear unclustered 
when sampled with limited sensitivity and coarse angular resolution common to 
current sub-millimeter surveys. The simulations, however, also show that un- 
clustered distributions can appear strongly clustered under these circumstances. 
From the simulations, we predict that at our survey depth, a mapped area of 
two degrees^ is needed to reconstruct the correlation function, assuming smaller 
beam sizes of future surveys (e.g. the Large Millimeter Telescope's 6" beam size). 
At present, robust measures of the clustering strength of bright SMGs appear to 
be below the reach of most observations. 

Subject headings: Galaxies: evolution. Galaxies: high-redshift, large-scale struc- 
ture of Universe, Sub-millimeter: galaxies 



Introduction 



High-redshift galaxies which are relatively bright at millimeter and sub-millimeter wave- 
lengths, and thus detectable by current ground-based instrumentation, have, over the last 
decade, come to the forefront of studies of galaxy evolution. Commonly referred to as 
sub- millimeter galaxies (SMGs), because the first significant deep surveys have been made 
at A = 450 and 850 /xm, these sources are thought to be largely obscured by dust, with 
star-formation rates of up to 1000 Mc^yeax~^ needed to power their high rest-frame infrare d 
luminosity of L/r ~ IQ^'^-W^Lq JSmail et al.lll997l : lHughes et al.lllQQsHBarger et al.lll998h . 
It has long been speculated that with such high luminosity and star formation rates, SMGs 
should be very m a ssive, strongly cluste red, and trace large-scale structure at high redshift 
(IBlain et al.l |2004J : lAmblard et al.l 120111 ). If it is conclusively found that SMGs do cluster 
strongly in space, and therefore trace massive dark matter halos at high redshift, this will 
provide additional evidence that these sources are evolutionarily linked to massive elliptical 



galaxies often fou nd in the center of galaxy clusters in the local universe (ILilly et al 



1999 



Eales et al.lll999l ). Hence, robust determination of the clustering strength of the SMGs, at 
least of those that are commonly detected with current instrumentation, namely with flux 
brighter than a few mJy , would have strong implications in theories of galaxy evolution 
(Ivan Kampen et al.ll2005l : iNegrello et al.ll2007l ). as it is not well understood what observable 
properties of galaxies are characteristic of biased tracers of the background dark matter 
distribution. 



Until recently, a secure measurement of SMG cluster ing has been elusive (IWebb et al. 



20031 : iBlain et al.ll2004j : IScott et al.ll2006l : IWeifi et al.ll2009[ ). in large part because of the slow 
mapping speeds of sub-millimeter instruments, whose maps have been very small in area. 
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Recently, data at 250-500/im from the Herschel Space Telescope, has produced improvements 
in terms of the area of su b-millimeter raaps, allowing clustering rn easurements to be made 
with improved statistics ( iMaddox et al.l l2010t ICooray et al.l 120101 ) . The Herschel surveys, 
however, are biased to low redshift and low luminosity galaxies as a result of the bluer 
wavelengths that they cover and of the negative k-correction; they also still suffer from source 
confusion despite their large area. Clustering measurements at longer wavelengths, on the 
other hand, still remains uncertain. Sub-millimeter surveys are still limited by large beam 
size and shallow survey depths which have prevented robust positions and large sample sizes. 
As sub-millimeter maps become larger with higher resolution, and the number of securely 
detected sources becomes statistically significant, studies of clustering of these SMGs will 
surely provide interesting implications for galaxy evolution. 

Here we present measures of the angular clustering of SMGs from one of the first mil- 
limeter maps containing a statistically significant number of SM Gs. This map cover s a 1 
square degree section of the Cosmic Evolution Survey (COSMOS; IScoville et al.l 120071 ) field 
using the AzTEC bolometer array, mounted on the Atacama Sub-millimeter Telescope Ex- 
periment (ASTE). The data provide the largest contiguous map, and the largest galaxy 
sample, at 1.1 millimeters to date (Aretxaga et al., in prep). While spectroscopic redshift 
information on SMGs remains sparse, we have used various redshift distributions to estimate 
de-projected spatial clustering for these galaxies. We assume a cosmology with Q\ = 0.7, 

= 0.3, and Ho = 100/i km Mpc~\ 



2. Observations 

The COSMOS field was mapped with AzTEC on ASTE, a full description and results 
will be presented in a separate publication (Aretxaga et al., in prep.). We imaged a subset 
of the COSMOS blank field centered at {RA, Dec) = (150.125, 2.23) with a total area of 1.41 
degrees^, totaling 112.6 hours of observing time. With AzTEC on ASTE, the beam size is 
28" (full- width half max). For this analysis we have considered only the region of the map 
where the coverage was 50% of the maximum value or higher. This results in a contiguous 
map of 0.72 degrees^. We achieve an average noise level of 1.26 mJy/beam, which is very 
uniform throughout the area considered, ranging from 1.23 to 1.27 mJy/beam. 

Our millimeter sources are selected by searching for peaks a bove a given sign al-to-noise 



ratio (S/N) with a window corresponding to the beam size (e.g. IScott et al.ll2008l ). We find 
328 sources with a S/N above 3.0, and 189 sources with a S/N above 3.5, hereafter the 3.0-O" 
and 3.5-cr catalogs, respectively. The map and source positions are shown in Figure [TJ 
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1.1 mm sources in AzTEC/COSMOS 




150.6 150.4 150.2 150.0 149.8 
Right Ascension (degrees) 



Fig. 1. — The 50% coverage region in the AzTEC/COSMOS map. 3.0 to 3.5-cr sources are 
circled in black, >3.5-a sources are circled in white. Circle size corresponds to one and a 
half times the beam size. 

3. Clustering Analysis 

3.1. Angular Clustering 

The angular two-point correlation function, w{6), measures the excess probability, above 
that expected for a random distribution, of finding two galaxies with an angular separation 
6, within a solid angle 6fl. It is defined in terms of the p robability 5P = iV^[l + w{6)]6Q, 
where N is the surface density of galaxies (jPeebles 1980 ). We measure angular clustering 
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of SMGs in the COSMOS field using the Landy-Szalay estimator of the angular correlation 



function (ACF; iLandy &: Szalaylll993l ). This can be measured as 



DD(9) - 2DR(9) + RRid) 

= me) ' 

where DD{9) is the number of observed galaxy pairs as a function of angular separation, 
6', DR{6) are the number of cross-pairs between the observed galaxies and a randomly 
distributed sample, and RR{6) is the number of randomly distributed pairs. The random 
distributions are generated by inserting randomly positioned sets of artificial sources into 
realizations of the noise distribution in our COSMOS map. The injected sources have a flux 
distribution based on our best estimate of SMC number counts from blank field observations 



(lAustermann et al.ll201Cll ). but we tuned the parameters such that the number of significant 
sources retrieved by our source finding algorithm are on average within 2% of the number 
of detections in the real map. We generate 100 of these simulations, and we use the random 
sources selected above the corresponding S/N threshold from each as random distributions. 
We have verified that (on average) the noise peaks in our COSMOS map and sources in the 
simulations are unclustered at the angular separations we consider (see Figure [2]). 

We expect our uncertainty to be dominated by small number statistics, but it is possi- 
ble that the map properties, such as non-uniformity and beam size, contribute to our error 
in measuring the ACF. So rather than assume Poisson error^ given by 6w{9) = ^^^^^^ 

( ILandy fc Szalaylll993[ ). which do not take these effects into account, we also quantify the 
uncertainty using the simulations. To do this we calculate the ACF of each of the 100 sim- 
ulated random catalogs, whose intrinsic ACF we know (on average we found that simulated 
sources are unclustered). Thus, the standard deviation of the ACF of the individual simu- 
lated catalogs, asim{e)i should include any error in w{0) related to the properties of the map. 
This uncertainty is given by 5w{6) = (1 + w{9))asim{e). We find the uncertainty obtained 
this way to be smaller than the Poissonian error for both catalogs, so we have conservatively 
assumed the latter. 

The difference between the 3.0- and 3.5-0" catalogs from the observed map is a trade 
off between a larger catalog of galaxies, and a lower false detection rate of sources in our 
observed map. By setting a high S/N threshold for detection we can achieve negligible 
false detections, but our number of sources would be small. We choose to select sources 
with a lower S/N threshold, while acknowledging that some fraction of them are not real 



^We found that Poisson errors are sometimes incorrectly used in the Uterature, as Sw{9) 



correct expression is 6w{0) 



^ i+w{e) 



iLandv fc Szalavlll993l) . 
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Fig. 2. — Averaged ACF for the peaks detected at 3.0-cr from the 100 noise reahzations 
(gray circles) and for the simulated sources detected at 3.0-a (black diamonds). Errors 
indicate standard deviation. Gray horizontal line corresponds to zero clustering. Slight anti- 
correlation around the 60" bin is a result of the beamsize; in general more random pairs 
with angular separations in this bin will be found since distances between detected sources 
are not smaller than twice the beamsize. 



SMGs. For the 3.5-0" catalog, we expect that about 9% of the sources are false detections, 
and for the 3.0-0" catalog, 24% are false detections (Aretxaga et al, in prep.). Including 
some fraction of randomly positioned non-galaxies will only serve to dilute our estimate of 
the clustering. As noise peaks are inherently unclustered, we can correct for this effect as 
Wofes(^) = (1 ^ fYwtrue{G) whcrc / is the fraction of false detections included in the catalog. 
Our ACF estimate, corrected for dilution by random false detections, is presented in Figure 
[31 We do not include in the analysis ACF measurements at angular separations smaller than 
twice the beam size. 

We assume that at the angular separations we are considering, the ACF behaves as a 
power law of the form w{6) = A^O'^ — IC , where we refer to as the clustering amplitude. 
IC refers to th e integ ral constraint correction, which we calculate using the algorithm of 



Roche &: Ealed (119991 ). The result of a power law fit where the clustering amplitude and 



- 7- 



0.6 
0.5 
0.4 
0.3 
0.2 
0.1 

-0.1 



3.5o sources 
besl fit 

68% confidence upper limit 
zero clustering 



100 1000 
Angular Separation 8 [arcsec] 



0.6 
0.5 
0.4 
0.3 
0.2 
0.1 


-0.1 



3. Oct sources 
Best fit 

68% confidence upper limit 
zero clustering 



100 1000 
Angular Separation 9 [arcsec] 



Fig. 3. — ACF for the two catalogs (points). The lower dot-dashed lines are the best fit 
power-law to the data, upper dotted lines are the 68.3% confidence level upper limits. Both 
power laws shown assume /3 = 0.8, and have their corresponding IC subtracted to match 
what is fit to the data. 



slope /3 are left as free parameters is poorly constrained, and the best slope is unphysically 
steep due to the fact that the measured ACF is high at the lowest angular scale. So given 
our large uncertainties, we do not attempt to constrain both clustering amplitude and slope. 
Instead, we assume two different representative values for (3. The first, /3 = 0.8, is observed 
for massive elliptical galaxies in large low redshift surveys and is the value typically as sumed 
for massive galaxies and SMGs at high redshift (jZehavi et al.ll2002l : iBlain et al.l 120041 ). The 
second value, /3 = 0.6, is the shallower slope typically observed for normal ultra-violet 



BX/BM galaxies 


(Giavalisco & Dickinson 


2001 


Adelbereer et al. 


2005 


), as well as starforming 



Porciani & Giavalisco 


2002; 


Lee et a 


2006 


galaxies at low redshift ( 


Zehavi et al. 


2002) 



Unless otherwise stated, in the text we will quote results derived using f3 = 0.8. 

In Figure |3] we also show the best fitting power laws (assuming (3 = 0.8) for each 
catalog found from a least squares minimization. Due to the large uncertainties, the best fit 
amplitudes are poorly constrained and the la upper limits to these best fit values are large. 
These upper limits are also shown in Figure [31 The case of zero clustering lies within the la 
error (defined as Ax^ < 1), but as negative values imply anti-correlation and are considered 
unphysical, we set zero clustering to be the lower limit. These best fits, 1-a upper error, 
and lower error (as described above) are = 3.7~^lj arcsecond°'^ for the 3.5-0" catalog 



and A,„ = l.l]*^?'! arcsecond^'^ for 3.0-cr, and are summarized in Table [T] along with results 



assuming f3 



-1.1 
0.6. 



While this analysis provides a best fit with confidence intervals, it does not limit a 
priori the possible range of values that A^ can assume. We want to explore the effect on the 
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Fig. 4. — Distributions in amplitudes for power law fits with slopes fixed to /3 = 0.8 and 
0.6 for 3.0- and 3.5-cr catalogs, generated by Monte Carlo simulations of the observed ACF. 
The vertical lines correspond to the 68.3% (smaller Ay^) and 99.5% (larger Ay^) limits in each 
distribution, and represent the largest clustering strengths allowable by our datasets. See 
Table 1 for corresponding values. 
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power law fit if we only consider positive values for A^, as negative values are unphysical. 
To do this we perform Monte Carlo simulations where we generate 5000 Gaussian deviated 
realizations of the observed ACF. The Gaussian deviates are generated using the Poisson 
error on each value of w{d). We fit each deviated realization with the same power law 
form outlined above to produce a distribution of best fitting clustering amplitudes, A^. The 
resulting distributions in A^, given assumed values of /3, are shown for each catalog in Figure 
m For both catalogs, the most likely values to be measured for A^ is zero, corresponding to 
the case where SMGs are unclustered. It must be emphasized that this does not mean that 
SMGs such as these are spatially unclustered, only that the strength of their clustering is 
below what is robustly detectable from our survey. The peak at zero is merely an effect of 
the likelihood of the distribution extending into the negative values of A^, since is 
poorly constrained. 

Using these distributions shown in Figure HJ we set upper limits to the power law 
amplitudes which are allowable given our measured ACF for SMGs, so our results can be 
compared with previous measurements of SMG clustering. These distributions are one-sided 
(because the peak lies at zero, the lowest value we allow for A^j), and so the distributions 
can only provide an upper limit. This is in contrast to the distribution which is two-sided 
and so provides an upper and lower limit (where conventionally the 68.3% confidence limits 
are given by Ax^ < 1). The two upper limits are different from each other in that the 68.3% 
confidence level upper limit from the minimization corresponds to a 15.85% probability 
of finding a larger A^, whereas the one-sided 68.3% confidence level upper limit from the 
Monte Carlo simulation corresponds to a 31.7% probability of finding a larger Ay^. We find 
that using these distributions from the Monte Carlo simulations we can reject power law 
models with amplitudes larger than Ay, = 2.4 arcsecond^'^ at the 68.3% confidence level, 
and Ay, = 7.9 arcsecond"'^ at the 99.5% confidence level for the 3.0-0" catalog, and Ay, = 5.2 
and Ay, = 11.5 arcsecond°-® at the 68.3% and 99.5% confidence levels, respectively, for the 
3.5-cr catalog. These results are shown as solid and dashed vertical lines in Figure [Hand are 
summarized in Table [TJ 



3.2. Spatial Clustering 



To derive the spatial correlation lengt h we have de -projected the angular correlation 
function using the Limber transformation (jPeebleslll980l ) and assuming a redshift distribu- 
tion for SMGs. Robust measures of this distribution are limited, in large part because coarse 
angular resolution of sub-millimeter and millimeter maps results in large positional uncer- 
tainties, making counterpart identification for spectroscopic foUowup difficult. The crude 
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Table 1: SMG Clustering Results. 



Catalog 


N 




Fit Type'^ 


/3 








To'' 




f 

ro 








[mJy] 






best 


upper 
limit 




best 


upper 
limit 


best 


upper 
limit 


3.5-a 


189 


4.2 




0.8 


3.7 


11.9 


.015 


10.0 


19.2 


9.6 


18.4 










0.6 


1.2 


4.4 


.018 


9.5 


21.4 


9.0 


20.4 








MC 68.3% 


0.8 
0.6 




5.2 
1.8 






12.1 
12.2 




11.6 
11.7 








MC 99.5% 


0.8 
0.6 




11.5 
4.3 






18.8 
21.1 




18.1 
20.1 


3.0-a 


328 


3.7 




0.8 


1.1 


7.8 


.004 


5.1 


15.2 


4.7 


14.6 
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2.9 


.004 
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16.5 


3.8 


15.7 








MC 68.3% 


0.8 
0.6 




2.4 
0.7 






7.9 
6.8 




7.6 
6.5 








MC 99.5% 


0.8 
0.6 




7.9 
2.8 






15.3 
16.1 




14.7 
15.4 



"Flux limit at 1.1 mm of the catalog 

''For the reduced fit, best fitting results are listed with the corresponding upper limit where Axf, > 1- In 
the case of the Monte Carlo results (indicated by MC), values are percentage of confidence level upper limit 
in the acceptable value of amplitude A^, given the SMG catalog. 

'^Best fits and upper limits to A^,, (in arcseconds^). Lower limits in all cases are zero as explained in the text. 
'^IC values correspond to the best fit power-law. 

'^Correlation length in units h^^Mpc, given our assumption of redshift distribution of Chapman et al. ( 20051 ) 



Using redshift distribution of IChapin et al.l (|2009l ) 



knowledge of the redshift distribution for these galaxies provide an additional source of sys- 
tematic error in the derivation of the spatial clustering. Here we discuss results obtained 
by assuming two redshift distributions which are believed to be representative of SMGs de- 
tected in the same range of far-IR waveleng t hs as the ones considered here. The most widely 
used is the distribution of I Chapman et al.l (l2005l ). compiled from a set of 75 spectroscopic 
redshifts for 850/im-selected SMGs with optical counterparts identified using deep interfero- 
metric radio continuum imaging. This redshift distribution is known to be biased towards low 
redshifts due to the requirement of a radio detected counterpart, so we use the version of this 
distribution which has been corrected for the radio bias. The corrected distribution is well 
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described by a Gaussian peaking at z = 2.3 and a spread of 1.2, ranging from 1 < z < 3.5. 
Using this distribution, we find that the 68.3% confidence level upper limit of consistent 
correlation lengths for SMGs are < 6-8 h~^Mpc and < 11-12 h~^Mpc for the 3.0-cr a n d 3.5 - 
a catalogs, respectively. Results assuming the redshift distrib ution of IChapin et al.l (120091 ) 
produce similar values, which are summarized in Tabled The lChapin et all J2009h redshift 
distribution is generated from a combination of spectroscopic and photometric redshifts for 
1.1-mm dete cted galaxies, so may b e more applicable to this study. The distribution differs 
from that of IChapman et al.l (120051 ) in that it peaks around z = 2.7 and has a high-redshift 
tail to z > 4. We emphasize that these results are upper limits, and therefore the intrinsic 
clustering of this set of galaxies are likely to be lower. 



4. Discussion 

4.1. Comparison to other SMG clustering measurements 

Before comparing our clustering measures with other works it is important to keep in 
mind the limitations inherent in the selection of samples based on an observable property, 
such as flux, as opposed to a physical property, such as luminosity or mass. The term 
"Sub-Millimeter Galaxies" is often used to indicate a category of galaxies (population is the 
term often used in this context) thought to have well specified and somewhat homogeneous 
properties. For example, SMGs are commonly interpreted as massive systems characterized 
by prodigious star-formation rates powered by major merger events. While these properties 
most likely apply to some SMGs of relatively large far-IR luminosity, it obviously is unrea- 
sonable to think that they are generic to any galaxy that is detectable at some wavelengths 
around 1 mm. Firstly, we should remind that galaxies detected at some wavelength with 
some telescope/instrument combination do not, generally speaking, span the same range of 
the far-IR luminosity function or redshift as galaxies from another instrumental configuration 
observed at another wavelength in the sub-millimeter /millimeter spectral region. Lumping 
all such samples as "SMG" believing that they share very similar properties is misleading. In 
other words, the definition of "SMG" as galaxies that are detected at wavelengths crudely in 
the range 500 fim to 1 mm at the sensitivity of current survey facilities, does not result into 
the selection of common physical properties. It is true that, since current ground-based fa- 
cilities working at the popular 850 /im wavelength have limited dynamic range in sensitivity, 
the resulting samples of galaxies at similar redshifts also have similar far-IR luminosity and 
thus, presumably, physical properties. But this is just an "observational accident" that does 
not apply to other sub-millimeter surveys. In general, galaxies detected at 350 /im with 
Hers chel/ SPIKE or at 1.1-mm with ASTE/AzTEC, even if at the same redshift as those 
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observable with JCMT/SCUBA, will cover different portion of the far-IR luminosity func- 
tion, and will generally have different physical properties, such as mass, clustering strength, 
star-formation rate, etc. (we are not addressing here the different sensitivity and redshift 
distribution function of the corresponding samples). 

With this caveat in mind, we can try to compare our results with others. We find 
that at 1.1mm and down to 1.26 mJy the angular clustering of SMGs in the COSMOS 
field is poorly constrained and with our sample size we can only set upper limits to the 
correlation length. Our 68.3% confidence level upper limits to the correlation length from 
the Monte Carlo simulation are < 6-8 h~^Mpc or < 11-12 h~^Mpc, depending on flux limit. 
Generally, our SMG clustering limits are higher for the higher fl ux limit (4.2 mJy). The 
recent prediction from the theoretical model of SMG clustering by lAlmeida et al.l ( 120101 ) for 
Ss50iim > 5mJy is Tq ~ 5 h~^Mpc, which is consistent within the error of our measurements. 
The S'l.imm > 3.7mJy flux limit of the S.O-c r sources roug h ly tra nslates to Ssnoum > 5.9mJy, 
assuming the spectral index suggested by IChapin et al.l (120091 ) of Su oc z/^-^ between 1.1- 
mm and 850-/im sources. Our 1.1-mm sources are likely similar to this simulated galaxy 
distribution. 

In terms of the results of previous studies of SMG clustering, direct comparisons are 
difficult to make because of the differing wavelengths and flux limits of each survey. Poorly 
quantified redshift distributions in all cases further complicate the issue when trying to make 
comparisons. As we found in this paper, the brighter SMGs show evidence for stronger clus- 
tering than the fainter SMGs consistent with what is expect ed from galaxy evoluti on models 



(e.g. Almeida et al. 2010). A similar result is also found by iBrodwin et al.l ( l2008l ) based on 
the clustering of Spitzer-selected ultra-luminous infrared galaxies. This demonstrates that 
caution should be taken when comparing the clustering of different samples of SMGs se- 
lected at different wavelengths down to different depths. If we ignor e these effects, a direct 



comparison makes our upper limits inconsistent with the results of ( IScott et al.ll2006l ). but 



(Webb et al. 


2003; 


Blain et al. 


2004; 


WeiB et al. 


2009 



Maddox et al.ll2010[ ). These comparisons are summarized in Table |2] and Figure |5l Our flux 
limit is generally higher than other studies, when translated to a common wavelength, so it 
may be reasonable to assume the AzTEC sources should be more strongly clustered. 

In a recent paper, Amblard et al. (2011) measured the clustering at 250-500 /im from the 
brightness fluctuations in the power spectrum of Herschel/SPIKE maps after masking out 
the bright, detected sources. Assuming we are probing the Raleigh- Jeans tail of the spectral 
energy distribution, the average flux density ratio is Si,iram/ Ss^o^m ~ 8 (with a spectral 
index of 1.8, Chapin et al. 2009). Given the confusion limit of SPIRE, these fluctuations are 
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Fig. 5. — Correlation lengths and flux limits (translated to 850/im using the spectral index 
of Chapin et al. (2009)) from this and previous studies. 

probing the clustering of sources down to 350 fim fluxes of a few mJy (Amblard et al. 2011), 
which translates to only 0.4 mJy and 0.25 mJy at 850/im and 1.1mm, respectively. This 
limit probes galaxies down to Ljr ~ 3 x 10^^ Lq at 2 ~ 2, much fainter than the typical 
limits of sub-millimeter surveys. It is not clear which part of the far-IR luminosity function 
contributes most to the clustering signal measured by the fluctuations. 

While the Amblard et al. (2011) result provides an interesting constraint on the clus- 
tering of fainter sub-mill imeter-emitting g alaxies, these sources are much more numerous 



than typical SMGs (e.g. ISmail et al.ll2002l ). and are not expected to evolve into the most 
massive elliptical galaxies in the local Universe. The halo masses derived in Amblard et 
al. (2011) are more comparable to those of the less extreme Lyman Break galaxies than the 
bright, detected SMGs. With larger telescopes such as the LMT and CCAT, we will be able 
to individually detect galaxies down to S'l.imm < O.lmJy and measure the clustering as a 
function of luminosity, a strong test of various galaxy evolution models. 



The strength of SMG clustering is an additional test of evolutionary models because 
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it can discriminate between the various formation mechanisms for SMGs. Discriminating 
between merging or cold-mode accretion as the dominant mechanism by which SMGs form 
at high-redshift is of particular interest, an d recent siinulatio ns of each mechanism predict 
different correlation lengths. The model of iDave et al.l ( 120091 ). where SMGs are formed by 
accretion of large amounts of cold gas, predicts a large correlation length (ro ~ 10 h~^Mpc) 
because cold gas accretion should be most influential in the most massive dark matter halos. 
Merger driven scenarios on the other hand predic t a more modest range in correlation lengths, 
between = 5-6 /i^^Mpc ( Almeida et al.ll2010l ). We are not yet at the point where we can 



see distinguishing evidence between the models, but this will also be an important goal of 
larger sub-millimeter observatories. 

Additionally, due to the large uncertainty in our measurement, our results are also con- 
sistent with measurements of weaker clustering from other types of high-redshift starforming 



sources contributing to the cosmic infrared background ( 


Lee et al. 


2006; 


2005; 


Giavalisco & Dickinson 


2001; 


Havashi et al. 


2007; 


Viero et al. 


2009) 



2006; lAdelberger et al. 



20091 ) . Their minimum 



halo masses of ~ lO^^-lO^^Mp and correlation lengths of a bout tq ~ 4-5 h ^Mpc (ILee et al. 



2006 



Porciani fc Giavahsco 2002; Adelberger et al. 20051) are consistent with the masses 



and c orrelation lengths for both bright and faint SMGs (lAlmeida et al.ll2010l ; lAmblard et al. 



20 111 ) . If the underlying sub-mil hmeter galaxy popula tion we detected in this stu dy is weakly 
clustered, as may be implied by lAlmeida et al.l ( 120101 ) and lAmblard et al.l (1201 ll ). it supports 
our conclusion from section 2 that the clustering is too weak to be detected with our survey. 



4.2. Map limitations on measuring clustering 

From a practical point of view, an important question to answer is: what characteristics 
of area and depth should surveys of SMGs have in order to yield robust measures of clustering. 
For example, how much area and down to which flux limit, should a survey with AzTEC reach 
in order to test the hypothesis that SMGs at the bright end of the far-IR luminosity function 
are the progenitors of massive elliptical galaxies, and should therefore be strongly clustered? 
In addressing this question, one needs to take into account the key contributors to the error 
budget of the measures, such as 1) the uncertainty in the redshift distribution, since a wide 
one that covers a large redshift interval washes out the clustering signal due to projection 
effects; 2) the sparse sampling of the underlying SMG population, which determines the shot 
noise in the ACF measures; 3) the large beam size of current observations, which prevents 
one from measuring the ACF at small angular scales where the signal is strongest. 

We have done Monte Carlo simulations to investigate the extent to which these map 
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properties are affecting our ability to measure SMG clustering. Specifically, we measure the 
ACF from realizations of galaxy distributions for which we have defined the intrinsic clus- 
tering, and impose AzTEC-like map properties. The realizations are made by generating 
a log-normal density distribution with an intrinsic ACF, and Po i sson sampling the density 



field according to the methods outlined in iPorciani fc Giavaliscd (120021 ) . The resulting real 



izations are 0.72deg'^ in area and contain on the order of lO'' mock galaxies. To match the 
expected percentage of false detections, we merge the clustered mock set with a set of random 
positions, so that they make up 9%, like the 3.5-0" catalog. We then randomly sample points 
from the realization to match the observed number density of 3.5-cr sources in the AzTEC 
map, where the sampled objects are never closer than one beam size separation, and see how 
their ACFs compare with the intrinsic ACF of the realization. We test intrinsic ACFs which 
are strongly and weakly clustered according to power-laws of = 2.9 and = 0.5, re- 
spectively, where (3 = 0.8. These correspond to values of ~ 9 h~^Mpc and ~ 4 h~^Mpc 
for our assumed redshift distribution function. We again disregard ACF measurements for 
angular separations smaller than twice the beam size. The purpose of this test is to simulate 
the ACF we should expect to observe from a map similar to the AzTEC-COSMOS map, if 
SMGs are intrinsically strongly or weakly clustered galaxies. 

In Figure [6], we show the distributions in the value of w{9 = 100") and w{9 = 250") 
from the simulations for each power-law form we tested, after applying the correction for false 
detection rate. The distributions of the ACF values are very broad as should be expected 
because of the low number of mock galaxies used in each sample, as well as the fraction 
of random positions. If the measured values are corrected for the integral constraint then 
the peak of the distributions at each angular separation match well with the intrinsic value. 
When the distributions are compared to our observation, there is a very small chance of 
spuriously finding the observed value at 6* = 100" for the 3.5-0" sources, about 3% for the 
strong case and 0.75% for the weak case. At larger separations the chance increases to 
31% and 15%, respectively. However, the observed and simulated values for both angular 
separations for the strong clustering case are consistent within their errors. 

We have fit a power-law to the mock ACFs, assuming a fixed P of 0.6 or 0.8. The 
clustering amplitudes we recovered are shown in Figure [TJ along with the upper limits we 
derived from the real AzTEC map. These histograms are essentially probability distributions 
for clustering amplitudes that will be measured from 189 sources in the AzTEC map area if 
the intrinsic population is strongly or weakly clustered. In none of the cases explored here 
is it likely that the intrinsic power-law form will be recovered. The assumed /3 influences 
the shape of the distribution, but the probability always peaks at zero. There is 22% chance 
that fits assuming /3 = 0.8 will indicate zero clustering (24% for /3 = 0.6) even if the 
intrinsic correlation length is = 9 h~^Mpc. If the intrinsic value is = 4 /i-^Mpc, 
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Fig. 6. — Probability distributions for the ACF at 100 and 250" separations from the mock 
catalogs (black histogram). Dashed line is the mean from the mock catalogs, which when 
corrected for the IC agrees well with the intrinsic value (blue line). Histograms are roughly 
Gaussian, with standard deviation indicated by dot-dashed lines. Solid red is the observed 
ACF from the 3.5-cr catalog at those angular separations, with Poisson errors given by red 
dot-dashed lines. Each row are results from an intrinsic power-law form shown in the left 
panel. 

the percentages increase to 53% and 55% assuming (3 = 0.8 and 0.6, respectively. Even 
though the distributions in Figure |6] nicely correspond with the intrinsic value, the power- 
law distributions in Figure [7] peak at zero because the large fluctuations in each realization 
from the small sample size can cause negative values in the ACF. It is not possible to recover 
the intrinsic clustering properties, and it is not possible to differentiate between strong and 
weak clustering. The implications this has for millimeter and sub-millimeter surveys at this 
resolution and sensitivity, or any survey where such a sparse sampling of the underlying 
population is detected, is that the true clustering properties cannot be recovered. 
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Fig. 7. — The distributions of clustering amplitudes from fitting power-laws to ACFs from 
clustered simulated maps. Top panel is results from strong intrinsic clustering (A^ = 2.9), 
and bottom is from weak clustering [A^ = 0.5), where (3 = 0.8. Green hatched histograms 
are Ay^ distributions when fitting with an assumed P = 0.8, and in gray are assuming /3 = 0.6. 
Red lines are the 68.3% upper limits in A^i from the fit to the ACF of the 3.5-0" AzTEC 
sources, assuming /3 = 0.8 (solid red), and /3 = 0.6 (dot-dashed red). 
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4.3. Predictions for future surveys 



Our sensitivity to tlie clustering signal in this study is determined by two things. First, 
the number of sources, which depends on the area mapped and the depth, must be large 
enough to overcome the high shot noise stemming from the sparse sampling of the underlying 
galaxy distribution by the SMG selection. Second, the ability to measure small-scale sep- 
arations between galaxies, which depends on the beam size. Not surprisingly, we generally 
found that the probability to recover the intrinsic ACF increases slightly with decreasing 
beam size, however the intrinsic value for a sample size such as ours was still not the most 
likely to be observed down to a beam size of 5". Increasing the number of detected galaxies, 
which can be achieved by increasing survey sensitivity or survey area, provides the largest 
improvement. Fortunately, millimeter and sub- millimeter facilities are advancing and future 
studies of SMG clustering will benefit from increased sample sizes and improved angular 
resolution. Thus the real question becomes, what resolution and survey area will be neces- 
sary to get an accurate measure of SMG clustering? Using the strongly clustered simulation 
discussed in the previous paragraph for an ASTE-COSMOS sized map, we have estimated 
the limiting galaxy sample size (as a function of beam size) for which it is possible to recover 
the intrinsic clustering. A measurement of the clustering is considered to have recovered the 
intrinsic value if, after sampling the parent realization 2000 times and fitting the ACFs, the 
value of the intrinsic clustering amplitude lies within the standard deviation of the cluster- 
ing amplitude distribution. Additionally, the distribution must satisfy the requirement that 
the most likely value of clustering amplitude in the distribution also lies within the stan- 
dard deviation. This second condition rejects the types of distributions shown in Figures 
m and [71 The resulting 'region of robust recovery' is shown in Figure El We have added 
the approximate positions of previous surveys which have made clustering measurements, 
assuming the estimated number density of detected sources in each is constant. These place- 
ments indicate that previously measured ACFs, even the Herschel surveys with large area 
[16 square degrees], are still compromised by large beams and low sensitivity. The previous 
study by Scott et al (2006), measured from a combination of multiple SCUBA fields, falls 
barely within the robust region. However, one caveat of this simulation is that it assumes 
a contiguous map region. In this case, the signal to noise of the ACF measurement, which 
depends on the number of independent galaxy pairs, DD, is related to the total number of 
detected galaxies N by DD = 0.5N{N — 1). For discontiguous fields, galaxy pairs between 
fields cannot contribute and so the number of galaxy pairs is lower, given the same number 
of detected galaxies. For discontiguous identical fields, the number of pairs goes down by 
a factor of F where F is the number of fields. Thus, measurements made from galaxies in 
multiple fields will inevitably have lower signal to noise than a measurement using the same 
number of galaxies from a contiguous area. 
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Fig. 8. — Galaxy sample number needed to recover an intrinsic clustering with power- 
law form w{6) = 2.96'~°'^, as a function of beam size. Area up and left from the contour 
line indicates region where values may be recovered. Right-side axis indicates survey area 
required at this sensitivity to robustly measure the clustering of the 5*1, > 4.2 mJy sources 
investigated here. Some positions of previous surveys have been provided where possible, 
based on the source number density of their survey. 

The region in Figure [8] illustrates the difficulties in measuring the angular clustering of 
bright SMGs such as the S'l.imm > 3.7 mJy samples explored here, but additionally provides 
some guidance for future surveys which will aim to robustly measure the clustering of SMGs. 
Upcoming surveys with the Large Millimeter Telescope for example, with its beam size of 6", 
will be able to make robust measurements for these galaxies with a mapped area of about 
two square degrees. These future results will no doubt provide exciting discoveries about the 
parent population of sub-millimeter sources. 
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5. Summciry 

1. We have measured the angular clustering of SMGs detected at 1.1mm from the 
largest contiguous map at that wavelength to date. We have studied sources detected at 3.5 
(3)-cr with flux limits 5*1.1111111 > 4.2(3.7) mJy. The power-law flts are poorly constrained due 
to large uncertainties in the ACF. 

2. We have set upper limits to the spatial correlation lengths for these galaxies. For 
flux limits 5*1. imm > 4.2 mJy, we flnd tq < 11-12 h~^Mpc, and for 5*1. imm > 3.7 mJy we flnd 
ro < 6-8 h-'^Mpc. 

3. We have shown that for simulated clustered samples, our map properties, speciflcally 
survey area, depth, beam size, prevent us from accurately measuring strong clustering (e.g. 
with ro ~ 9 /i^^Mpc). 

4. We have used these simulations to predict the conditions under which future surveys 
may robustly detect clustering. Specifically, to measure clustering galaxies detected with 
'S'l.imm > 4.2 mJy and mapped to a depth of 1.26 mJy/beam, we will be able to robustly 
measure clustering with an area of ~ 2 square degrees, with the LMT's beam size of 6". 

The authors wish to thank Sara Salimbeni, and Paolo Cassata for useful discussions, 
and Dan Popowich for technical support. We also thank Asantha Cooray for valueable 
comments. Support for this work was provided in part by the National Science Foundation 
grants AST-0838222 and AST-0907952. 
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Table 2: SMG Clustering and Flux Limits. 



N 

^ ' sources 


To ± 


A 


Si/ 




beam size 


Reference 




[ /i-^Mpc] 


[/im] 


[mJy] 


[mJy] 


[arcseconds] 




27 


12.8 ±7.0 


850 


3.0 


3.0 


14.5 


Webb et al. 2003 


47 


6.9 ±2.1 


850 


5.0 


5.0 


14.5 


Blain et al. 2004 


51 


31 ^ 


850 


5.0 


5.0 


14.5 


Scott et al. 2006 


126 


13 ±6 


870 


4.6 


4.8^ 


19.2 


Weiss et al. 2009 


1633 


7-11 


350 


36 


7.3^ 


17 


Maddox et al. 2010 


189 


<11-12= 


1100 


4.2 


6.7^ 


28 


This study 


328 


<6-8^ 


1100 


3.7 


5.9^ 


28 


This study 



"Only angular clustering was published bv lScott et all (|2006l ). we tra nsform their pow e r-law result and errors 
for their sources above S/N of 3.5, using the redshift distribution of IChapman et all (|2005r ) 
*Flux density translated assuming Si, oc v^'^ 

'^We have quoted our 68.3% confidence level upper limits for comparison. 



